Biological data cleaning: a case study
نویسندگان
چکیده
As databases become more pervasive through the biological sciences, various data quality concerns are emerging. Biological databases tend to develop data quality issues regarding data legacy, data uniformity and data duplication. Due to the nature of this data, each of these problems is non-trivial and can cause many problems for the database. For biological data to be corrected and standardised, methods and frameworks must be developed to handle both structural and traditional data. This paper discusses issues concerning biological data quality with respect to data cleaning. It presents BIO-AJAX, a framework developed to address these issues. It finally describes BIO-JAX for TreeBASE and BIO-AJAX for Lineage Path, two implementations of BIO-AJAX on phylogenetic data sets.
منابع مشابه
Building a Disordered Protein Database: A Case Study in Managing Biological Data
A huge diversity of biological databases is available via the Internet, but many of these databases have been developed in an ad hoc manner rather than in accordance with any data management principles. In addition, in the area of disordered protein databases, many of the databases have not been made publicly available. This poses challenges to researchers, since reliable protein databases are ...
متن کاملIndustrial Cleaning with ultra-clean water according to the Qlean-method – a case study of printed circuit boards
The manufacturing industry today uses many kinds of chemicals in its cleaning processes. The industrial cleaners often contain some sort of degreasing chemical to clean parts and components before the main processes, for instance assembly or surface treatment. These types of cleaning methods are often expensive and involve hazardous handling of chemicals in manufacturing, as well as in the tran...
متن کاملSQLShare: Scientific Workflow via Relational View Sharing
We consider a case study in using a web-based query-as-a-service platform as an alternative to scriptbased scientific workflows. The context is a project in observational biological oceanography to share and process data from a ship-based continuous profiler of microbial populations called SeaFlow. The representative tasks involve aggregating and cleaning SeaFlow measurements, integrating the c...
متن کاملIndustrial cleaning with Qlean Water: a case study of printed circuit boards
Many manufacturing companies are looking for ways to substitute environmentally problematic cleaning methods for surface treatments with more environmentally friendly ones. In this paper, one potential solution is described. The Qlean method, based on cleaning with highly pure water (in this paper defined as Qlean Water), is a novel cleaning method. This method, now utilized at one plant at a l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJIQ
دوره 1 شماره
صفحات -
تاریخ انتشار 2007